Semi-supervised Co-training Algorithm Based on Assisted Learning

نویسندگان

  • Hong-li Wang
  • Rong-yi Cui
چکیده

The classification performance of the learner is weakened when unlabeled examples are mislabeled during co-training process. A semisupervised co-training algorithm based on assisted learning (AR-Tri-training) was proposed. Firstly, the assisted learning strategy was presented, which is combined with rich information strategy for designing the assisted learner. Secondly, the evaluation factor was calculated, and noise was eliminated from unlabeled example set by using the assisted learner and the evaluation factor. Finally, three single learners were trained using labeled examples, wronglearning examples on validation set and less noise unlabeled examples. The experimental results on application to voice recognition indicate that AR-Tritraining can compensate for the Tri-training shortcomings and the average classification accuracy is increased by 15%. As can be drawn from the experimental results, AR-Tri-training not only removes the mislabeled examples in training process, but also takes full advantage of the unlabeled examples and wrong-learning examples on validation set.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Semi-Supervised Learning Algorithm Based on Modified Self-training SVM

In this paper, we first introduce some facts about semi-supervised learning and its often used methods such as generative mixture models, self-training, co-training and Transductive SVM and so on. Then we present a self-training semi-supervised SVM algorithm based on which we give out a modified algorithm. In order to demonstrate its validity and effectiveness, we carry out some experiments whi...

متن کامل

Combining Committee-Based Semi-supervised and Active Learning and Its Application to Handwritten Digits Recognition

Semi-supervised learning reduces the cost of labeling the training data of a supervised learning algorithm through using unlabeled data together with labeled data to improve the performance. Co-Training is a popular semi-supervised learning algorithm, that requires multiple redundant and independent sets of features (views). In many real-world application domains, this requirement can not be sa...

متن کامل

A Rough Set Method for Co-training Algorithm

In recent years, semi-supervised learning has been a hot research topic in machine learning area. Different from traditional supervised learning which learns only from labeled data; semi-supervised learning makes use of both labeled and unlabeled data for learning purpose. Co-training is a popular semi-supervised learning algorithm which assumes that each example is represented by two or more r...

متن کامل

Semi-Supervised Regression with Co-Training

In many practical machine learning and data mining applications, unlabeled training examples are readily available but labeled ones are fairly expensive to obtain. Therefore, semi-supervised learning algorithms such as co-training have attracted much attention. Previous research mainly focuses on semi-supervised classification. In this paper, a co-training style semi-supervised regression algor...

متن کامل

On Semi-Supervised Classification

A graph-based prior is proposed for parametric semi-supervised classification. The prior utilizes both labelled and unlabelled data; it also integrates features from multiple views of a given sample (e.g., multiple sensors), thus implementing a Bayesian form of co-training. An EM algorithm for training the classifier automatically adjusts the tradeoff between the contributions of: (a) the label...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011